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The following items have been submitted by the applicant or the IB to the United States Patent and Trademark 
Office as a Designated / Elected Office (37 CFR 1.495) 



• Priority Document 

• Copy of the International Application filed on 09/14/2005 

• Copy of the International Search Report filed on 09/14/2005 

• Copy of IPE Report filed on 09/14/2005 

• Preliminary Amendments filed on 09/14/2005 

• Biochemical Sequence Diskette filed on 09/14/2005 

• Oath or Declaration filed on 07/03/2006 

• Biochemical Sequence Listing filed on 11/13/2006 

• U.S. Basic National Fees filed on 09/14/2005 

• Priority Documents filed on 09/14/2005 

• Specification filed on 09/14/2005 

• Claims filed on 09/14/2005 

• Abstracts filed on 09/14/2005 

• Drawings filed on 09/14/2005 

• Paper nucleotide sequence listings filed on 09/14/2005 

Applicants response filed 1 1/13/2006 is hereby acknowledged. The following requirements set forth in the 
NOTIFICATION of MISSING REQUIREMENTS mailed 05/02/2006 have not been completed. 

• The paper or compact disc copy of the "Sequence Listing" is not the same as the computer readable form 
of the "Sequence Listing" as required by 37 CFR 1.821(e). Applicant must provide a substitute paper or 
compact disc copy of the "Sequence Listing", as well as an amendment specifically directing its entry 
into the application OR a substitute computer readable form (CRF) copy of the "Sequence Listing". 
These two items must be the same. Applicant must also provide a statement that the content of the 
sequence listing information recorded in computer readable form is identical to the written (on paper or 
compact disc) sequence listing and, where applicable, includes no new matter, as required by 37 CFR 
1.821(e), 1.821(f), 1.821(g), 1.825(b), or 1.825(d). If the effective filing date is on or after September 8, 
2000, see the final rulemaking notice published in the Federal Register at 65 FR 54604 (September 8, 
2000) and 1238 OG 145 (September 19, 2000). 
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Applicant is required to complete the response within a time limit of ONE MONTH from the date of this 
Notification or within the time remaining in the response set forth in the Notification of Missing 
Requirements, whichever is the longer. No extension of this time limit may be granted under 37 CFR 
1.13C, but the period for response set in the Notification of Missing Requirements may be extended under 
37 CFR 1.136(a). 

Applicant is cautioned that correction of the above items may cause the specification and drawings page count to 
exceed 100 pages. If the specification and drawings exceed 100 pages, applicant will need to submit the required 
application size fee. 

For questions regarding compliance to 37 CFR 1.821-1.825 requirements, please contact: 

• For Rules Interpretation, call (571) 272-0951 

• For Patentin Software Program Help, call Patent EBC at 1-866-217-9197 or directly at 703-305-3028 / 
703-308-6845 between the hours of 6 a.m. and 12 midnight, Monday through Friday, EST. 

• Send e-mail correspondence for Patentin Software Program Help @ ebc@uspto.gov 

Applicant is reminded that any communications to the United States Patent and Trademark Office must be mailed 
to the address given in the heading and include the U.S. application no. shown above (37 CFR 1.5) 

Registered users of EFS-Web may alternatively submit their reply to this notice via EFS-Web. 
https://sportal.uspto.gov/authenticate/AuthenticateUserLocalEPF.html 

For more information about EFS-Web please call the USPTO Electronic Business Center at 1-866-217-9197 or 
visit our website at http://www.uspto.qov/ebc. 

If you are not using EFS-Web to submit your reply, you must include a copy of this notice. 

ANITA D JOHNSON 
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FORM PCT/DO/EO/916 (371 Formalities Notice) 



STIC Biotechnology Systems Branch 
RAW SEQUENCE LISTING 



ERROR REPORT 



The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: A 




Application Serial Number: / Qf "J l ?-^"^ 



Date Processed by STIC: ~ff7^^/l> 



THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 
PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION AND PATENTIN SOFTWARE QUESTIONS, PLEASE CONTACT 
MARK SPENCER, TELEPHONE: 57 1-272-251 0; FAX: 571-273-0221 



TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 4.4,0 PROGRAM, ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: 
http://www.uspto.gov/web/ofnces/pac/checker/chkrnote.htm 



Applicants submitting genetic sequence information electronically on diskette or CD-Rom should be aware that there is 

a possibility that the disk/CD-Rom may have been affected by treatment given to all incoming mail. 

Please consider using alternate methods of submission for the disk/CD-Rom or replacement disk/CD-Rom. 

Any reply including a sequence listing in electronic form should NOT be sent to the 2023 1 zip code address for the 

United States Patent and Trademark Office, and instead should be sent via the followine to the indicated addresses: 

1. EFS-Bio (<httD://www,uspto.gov/ebc/efe/downloads/documents.htni> , EFS Submission 

User Manual • ePAVE) 

2. U.S. Postal Service: Commissioner for Patents, P.O. Box 1450, Alexandria, VA 22313-1450 

3. Hand Carry, Federal Express, United Parcel Service, or other delivery service (EFFECTIVE 01/14/05): 
U.S. Patent and Trademark Office, Mail Stop Sequence, Customer Window, Randolph Building, 401 Dulany Street, 
Alexandria, VA 22314 



Revised 01/10/06 



Raw Sequence Listing Error Summary 



ERROR DETECTED 

ATTN: NEW RULES CASES 

1 Wrapped Nuclcics 

Wrapped Aminos 



HICH WERE INSERTED BY PTO SOFTWARE 



SUGGESTED CORRECTION SERIAL NUMBER: 

PLEASE DISREGARD ENGLISH "ALPHA* HEADERS, WHICH 

The number/iext at the end of each line "wrapped" down to the next line. This may occur if your file 
was retrieved in a word processor after creating it Please adjust your right margin to .3; this will 
prevent "wrapping." 



2 Invalid Line Length The rules require that a line not exceed 72 characters in length. This includes white spaces. 

3 Misaligned Amino The numbering under each 5 th amino acid is misaligned. Do not use tab codes between numbers; 

Numbering use space characters, instead. 

4 Non-ASCII The submitted file was not saved in ASCII(DOS) text, as required by the Sequence Rules. Please 

ensure your subsequent submission is saved in ASCII teit 

5 Variable Length Sequence(s) contain n's or Xaa f s representing more than one residue. Per Sequence Rules, 

each n or Xoo can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>«<223> section that some may be missing. 

6 Paientln 2.0 A "bug" in Patcntln version 2.0 has caused the <220>-<223> section to be missing from amino acid 

« 0U g" sequencers) . Normally, Patentln would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
J Artificial or Unknown sequences. 

7 Skipped Sequences Sequence^) missing. If intentional, please insert the following lines for each skipped sequence: 

(OLD RULES) (2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 

(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 
(xi) SEQUENCE DESCRIPTION :SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:" response to include the skipped sequences. 



8 Skipped Sequences Sequences) 

(NEW RULES) 



missing. If intentional, please insert the following lines for each skipped sequence. 



JJse ofn's or Xaa's 
(NEW RULES) 




<2I0> sequence id number 
<400> sequence id number 
000 

Use of n's and/or Xaa's have been detected in the Sequence Listing. 

Per L823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's are present. . 
In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 

Per 1 .823 of Sequence Rules* the only valid <2 1 3> responses are: Unknown, Artificial Sequence, or 
scientific name (Genus/species). <220>-<223> section is required when <2I3> response is Unknown or 
is. Artificial Sequence 



12 Patentln 2.0 

"bug" 



Use of <220> to <223 > is MANDATORY if <213> "Onanis m" response is "Artificial Sequence" or 
"Unkndwn^ PTS^Tex ptain source of genetic material in <220> to <223> section . 
(See "Federal Register," 06701/1998, Vol. 63, No. 104, pp. 29o7u33} (Sec. I.H23 of Sequence Rules^ 

Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use "File Manager" or any other manual means to copy file to floppy disk. 



1 3 Misuse of n/Xaa "n" can only represent a single nucleotide: "Xaa" can only represent a single amino fltid 
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IFWP 



RAW SBQUENCB LISTING 

PATENT APPLICATION: US/10/549, 352A 



DATE: 11/28/2006 
TIME: 14:28:06 



Input Set : B:\70237USPCT Second Substitute Seq Liet.txt 
Output Set: N:\CRF4\11282006\J549352A.raw 



3 
4 
5 
7 
9 
11 
12 
14 
15 
17 
18 
20 
22 
24 
25 
26 
27 
29 
30 
32 



<110> APPLICANT: DRAKE, Caroline Rachel 

PAINE, Jacqueline Ann Mary 

SHIPTON, Catherine Ann 
<120> TITLE OF INVENTION: Enhanced Accumulation of Carotenoids 
<130> FILE REFERENCE: 70237USPCT 

<140> CURRENT APPLICATION NUMBER: US 10/549, 352A 
<141> CURRENT FILING DATE: 2005-09-14 
<150> PRIOR APPLICATION NUMBER: PCT/GB2004/001241 
<151> PRIOR FILING DATE: 2004-03-24 
<150> PRIOR APPLICATION NUMBER: US60/457,053 
<151> PRIOR FILING DATE: 2003-03-22 

<160> NUMBER OF SEQ ID NOS: 38 <-*Y<&^ 
<170> SOFTWARE: Patentln version 3.2 *\ S^\^p^ V 
<210> SBQ ID NO: 1 ^\ . j^V^^TJW^J , 

<211> LENGTH: 5630 ^ \ W <^ Os*J ^^\^ 

<212> TYPE: DNA^-C^ O i^V^ 

<213> ORGANISM:fArtificial Se quence ^) 

<220> FEATURE: V - y * *^\ S^T^^ 1 f 

<223> OTHER INFORMATION: fl24 2*J^$X<Nvi b C yaOi^*— 



in Plants 




^ <400> SEQUENCE: 1 V^^^ ■ 

33 gttaatcatg gtgtaggcaa cccaaataaa acaccaaaat atgcacaagg cagtttgttg 
35 tattctgtag tacagacaaa actaaaagta atgaaagaag atgtggtgtt agaaaaggaa 
37 acaatatcat gagtaatgtg tgagcattat gggaccacga aataaaaaga acattttgat 
39 gagtcgtgta tcctcgatga gcctcaaaag ttctctcacc ccggataaga aacccttaag 
41 caatgtgcaa agtttgcatt ctccactgac ataatgcaaa ataagatatc atcgatgaca 
43 tagcaactca tgcatcatat catgcctctc tcaacctatt cattcctact catctacata 
45 agtatcttca gctaaatgtt agaacataaa cccataagtc acgtttgatg agtattaggc 
47 gtgacacatg acaaatcaca gactcaagca agataaagca aaatgatgtg tacataaaac 
49 tccagagcta tatgtcatat tgcaaaaaga ggagagctta taagacaagg catgactcac 
51 aaaaattcat ttgcctttcg tgtcaaaaag aggagggctt tacattatcc atgtcatatt 
53 gcaaaagaaa gagagaaaga acaacacaat gctgcgtcaa ttatacatat ctgtatgtcc 
55 atcattattc atccaccttt cgtgtaccac acttcatata tcatgagtca cttcatgtct 
57 ggacattaac aaactctatc ttaacattta gatgcaagag cctttatctc actataaatg 
59 cacgatgatt tctcattgtt tctcacaaaa agcattcagt tcattagtcc tacaacaacg 
61 aattcggctt cccgggtaca gggtaaattt ctagtttttc tccttcattt tcttggttag 
63 gacccttttc tctttttatt tttttgagct ttgatctttc tttaaactga tctatttttt 
65 aattgattgg ttatcgtgta aatattacat agctttaact gataatctga ttactttatt 
67 tcgtgtgtct ttgatcatct tgatagttac agaaccgtcg actctagaga agccatttaa 
69 atcgccgcca ccatggcttc tatgatatcc tcttccgctg tgacaacagt cagccgtgcc 
71 tctagggggc aatccgccgc agtggctcca ttcggcggcc tcaaatccat gactggattc 
73 ccagtgaaga aggtcaacac tgacattact tccattacaa gcaatggtgg aagagtaaag 
75 tgcatgaaac caactacggt aattggtgca ggcttcggtg gcctggcact ggcaattcgt 
77 ctacaagctg cggggatccc cgtcttactg cttgaacaac gtgataaacc cggcggtcgg 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
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RAW SEQUENCE LISTING DATE: 11/28/2006 

PATENT APPLICATION: US/10/549, 352A TIME: 14:28:06 

Input Set : E:\70237USPCT Second Substitute Seq Liet.txt 
Output Set: N:\CRP4\11282006\J549352A.raw 



79 gcttatgtct acgaggatca ggggtttacc tttgatgcag gcccgacggt tatcaccgat 1440 

81 cccagtgcca ttgaagaact gtttgcactg gcaggaaaac agttaaaaga gtatgtcgaa 1500 

83 ctgctgccgg ttacgccgtt ttaccgcctg tgttgggagt cagggaaggt ctttaattac 1560 

85 gataacgatc aaacccggct cgaagcgcag attcagcagt ttaatccccg cgatgtcgaa 1620 

87 ggttatcgtc agtttctgga ctattcacgc gcggtgttta aagaaggcta tctgaagctc 1680 

89 ggtactgtcc cttttttatc gttcagagac atgcttcgcg ccgcacctca actggcgaaa 1740 

91 ctgcaggcat ggagaagcgt ttacagtaag gttgccagtt acatcgaaga tgaacatctg 1800 

93 cgccaggcgt tttctttcca ctcgctgttg gtgggcggca atcccttcgc cacctcatcc 1860 

95 atttatacgt tgatacacgc gctggagcgt gagtggggcg tctggtttcc gcgtggcggc 1920 

97 accggcgcat tagttcaggg gatgataaag ctgtttcagg atctgggtgg cgaagtcgtg 1980 

99 ttaaacgcca gagtcagcca tatggaaacg acaggaaaca agattgaagc cgtgcattta 2040 
101 gaggacggtc gcaggttcct gacgcaagcc gtcgcgtcaa atgcagatgt ggttcatacc 2100 
103 tatcgcgacc tgttaagcca gcaccctgcc gcggttaagc agtccaacaa actgcagact 2160 
105 aagcgcatga gtaactctct gtttgtgctc tattttggtt tgaatcacca tcatgatcag 2220 
107 ctcgcgcatc acacggtttg tttcggcccg cgttaccgcg agctgattga cgaaattttt 2280 
109 aatcatgatg gcctcgcaga ggacttctca ctttatctgc acgcgccctg tgtcacggat 2340 
ill tcgtcactgg cgcctgaagg ttgcggcagt tactatgtgt tggcgccggt gccgcattta 2400 
113 ggcaccgcga acctcgactg gacggttgag gggccaaaac tacgcgaccg tatttttgcg 2460 
115 taccttgagc agcattacat gcctggctta cggagtcagc tggtcacgca ccggatgttt 2520 
117 acgccgtttg attttcgcga ccagcttaat gcctatcatg gctcagcctt ttctgtggag 2580 
119 cccgttctta cccagagcgc ctggtttcgg ccgcataacc gcgataaaac cattactaat 2640 
121 ctctacctgg tcggcgcagg cacgcatccc ggcgcaggca ttcctggcgt catcggctcg 2700 
123 gcaaaagcga cagcaggttt gatgctggag gatctgattt gaggccatgc aggccgatcc 2760 
125 ccgatcgttc aaacatttgg caataaagtt tcttaagatt gaatcctgtt gccggtcttg 2820 
127 cgatgattat catataattt ctgttgaatt acgttaagca tgtaataatt aacatgtaat 2880 
129 gcatgacgtt atttatgaga tgggttttta tgattagagt cccgcaatta tacatttaat 2940 
131 acgcgataga aaacaaaata tagcgcgcaa actaggataa attatcgcgc gcggtgtcat 3000 
133 ctatgttact agatcgggcc ttaataagct tgttaatcat ggtgtaggca acccaaataa 3060 
135 aacaccaaaa tatgcacaag gcagtttgtt gtattctgta gtacagacaa aactaaaagt 3120 
137 aatgaaagaa gatgtggtgt tagaaaagga aacaatatca tgagtaatgt gtgagcatta 3180 
139 tgggaccacg aaataaaaag aacattttga tgagtcgtgt atcctcgatg agcctcaaaa 3240 
141 gttctctcac cccggataag aaacccttaa gcaatgtgca aagtttgcat tctccactga 3300 
143 cataatgcaa aataagatat catcgatgac atagcaactc atgcatcata tcatgcctct 3360 
145 ctcaacctat tcattcctac tcatctacat aagtatcttc agctaaatgt tagaacataa 3420 
147 acccataagt cacgtttgat gagtattagg cgtgacacat gacaaatcac agactcaagc 3480 
149 aagataaagc aaaatgatgt gtacataaaa ctccagagct atatgtcata ttgcaaaaag 3540 
151 aggagagctt ataagacaag gcatgactca caaaaattca tttgcctttc gtgtcaaaaa 3600 
153 gaggagggct ttacattatc catgtcatat tgcaaaagaa agagagaaag aacaacacaa 3660 
155 tgctgcgtca attatacata tctgtatgtc catcattatt catccacctt tcgtgtacca 3720 
157 cacttcatat atcatgagtc acttcatgtc tggacattaa caaactctat cttaacattt 3780 
159 agatgcaaga gcctttatct cactataaat gcacgatgat ttctcattgt ttctcacaaa 3840 
161 aagcattcag ttcattagtc ctacaacaac gaattcggct tcccgggtac agggtaaatt 3900 
163 tctagttttt ctccttcatt ttcttggtta ggaccctttt ctctttttat ttttttgagc 3960 
165 tttgatcttt ctttaaactg atctattttt taattgattg gttatcgtgt aaatattaca 4020 
167 tagctttaac tgataatctg attactttat ttcgtgtgtc tttgatcatc ttgatagtta 4080 
169 cagaaccgtc gactctagag aagccattta aatcgccgcc accatggcca tcatactcgt 4140 
171 acgagcagcg tcgccggggc tctccgccgc cgacagcatc agccaccagg ggactctcca 4200 
173 gtgctccacc ctgctcaaga cgaagaggcc ggcggcgcgg cggtggatgc cctgctcgct 4260 
175 ccttggcctc cacccgtggg aggctggccg tccctccccc gccgtctact ccagcctgcc 4320 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/549, 352A 



DATE: 11/28/2006 
TIME: 14:28:06 



Input Set : B:\70237USPCT Second Substitute Seq List.txt 
Output Set: N:\CRF4\11282006\J549352A.raw 



4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5630 



177 cgtcaacccg gcgggagagg ccgtcgtctc gtccgagcag aaggtctacg acgtcgtgct 
179 caagcaggcc gcattgctca aacgccagct gcgcacgccg gtcctcgacg ccaggcccca 
181 ggacatggac atgccacgca acgggctcaa ggaagcctac gaccgctgcg gcgagatctg 
163 tgaggagtat gccaagacgt tttacctcgg aactatgttg atgacagagg agcggcgccg 
185 cgccatatgg gccatctatg tgtggtgtag gaggacagat gagcttgtag atgggccaaa 
187 cgccaactac attacaccaa cagctttgga ccggtgggag aagagacttg aggatctgtt 
189 cacgggacgt ccttacgaca tgcttgatgc cgctctctct gataccatct caaggttccc 
191 catagacatt cagccattca gggacatgat tgaagggatg aggagtgatc ttaggaagac 
193 aaggtataac aacttcgacg agctctacat gtactgctac tatgttgctg gaactgtcgg 
195 gttaatgagc gtacctgtga tgggcatcgc aaccgagtct aaagcaacaa ctgaaagcgt 
197 atacagtgct gccttggctc tgggaattgc gaaccaactc acgaacatac tccgggatgt 
199 tggagaggat gctagaagag gaaggatata tttaccacaa gatgagcttg cacaggcagg 
201 gctctctgat gaggacatct tcaaaggggt cgtcacgaac cggtggagaa acttcatgaa 
203 gaggcagatc aagagggcca ggatgttttt tgaggaggca gagagagggg taactgagct 
205 ctcacaggct agcagatggc cagtatgggc ttccctgttg ttgtacaggc agatcctgga 
207 tgagatcgaa gccaacgact acaacaactt cacgaagagg gcgtatgttg gtaaagggaa 
209 gaagttgcta gcacttcctg tggcatatgg aaaatcgcta ctgctcccat gttcattgag 
211 aaatggccag acctagggcc atgcaggccg atccccgatc gttcaaacat ttggcaataa 
213 agtttcttaa gattgaatcc tgttgccggt cttgcgatga ttatcatata atttctgttg 
215 aattacgtta agcatgtaat aattaacatg taatgcatga cgttatttat gagatgggtt 
217 tttatgatta gagtcccgca attatacatt taatacgcga tagaaaacaa aatatagcgc 
219 gcaaactagg ataaattatc gcgcgcggtg tcatctatgt tactagatcg 

222 <210> SEQ ID NO: 2 

223 <211> LENGTH: 5630 

224 <212> TYPE: DNA _ 0- 

225 <213> ORGANISM: (Artificial Sequence^ <T \ 

227 <220> FEATURE: - TV^Cv . 

228 <223> OTHER INFORMATION/" 12*21 \j^~ x1 J&A XtOS/S^^ 

230 <400> SEQUENCE: 2 V_ ^ dJTW 

231 gttaatcatg gtgtaggcaa cccaaataaa acaccaaaat atgcacaagg cagtttgttg 
233 tattctgtag tacagacaaa actaaaagta atgaaagaag atgtggtgtt agaaaaggaa 
235 acaatatcat gagtaatgtg tgagcattat gggaccacga aataaaaaga acattttgat 
237 gagtcgtgta tcctcgatga gcctcaaaag ttctctcacc ccggataaga aacccttaag 
239 caatgtgcaa agtttgcatt ctccactgac ataatgcaaa ataagatatc atcgatgaca 
241 tagcaactca tgcatcatat catgcctctc tcaacctatt cattcctact catctacata 
243 agtatcttca gctaaatgtt agaacataaa cccataagtc acgtttgatg agtattaggc 
245 gtgacacatg acaaatcaca gactcaagca agataaagca aaatgatgtg tacataaaac 
247 tccagagcta tatgtcatat tgcaaaaaga ggagagctta taagacaagg catgactcac 
249 aaaaattcat ttgcctttcg tgtcaaaaag aggagggctt tacattatcc atgtcatatt 
251 gcaaaagaaa gagagaaaga acaacacaat gctgcgtcaa ttatacatat ctgtatgtcc 
253 atcattattc atccaccttt cgtgtaccac acttcatata tcatgagtca_cttcatgtct 
255 ggacattaac aaactctatc ttaacattta gatgcaagag cctttatctc actataaatg 
257 cacgatgatt tctcattgtt tctcacaaaa agcattcagt tcattagtcc tacaacaacg 
259 aattcggctt cccgggtaca gggtaaattt ctagtttttc tccttcattt tcttggttag 
261 gacccttttc tctttttatt tttttgagct ttgatctttc tttaaactga tctatttttt 
263 aattgattgg ttatcgtgta aatattacat agctttaact gataatctga ttactttatt 
265 tcgtgtgtct ttgatcatct tgatagttac agaaccgtcg actctagaga agccatttaa 
267 atcgccgcca ccatggcttc tatgatatcc tcttccgctg tgacaacagt cagccgtgcc 
269 tctagggggc aatccgccgc agtggctcca ttcggcggcc tcaaatccat gactggattc 



gcgcgcggtg tcatctatgt tactagdtuy 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
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271 ccagtgaaga aggtcaacac tgacattact tccattacaa gcaatggtgg aagagtaaag 1260 

273 tgcatgaaac caactacggt aattggtgca ggcttcggtg gcctggcact ggcaattcgt 1320 

275 ctacaagctg cggggatccc cgtcttactg cttgaacaac gtgataaacc cggcggtcgg 1380 

277 gcttatgtct acgaggatca ggggtttacc tttgatgcag gcccgacggt tatcaccgat 1440 

279 cccagtgcca ttgaagaact gtttgcactg gcaggaaaac agttaaaaga gtatgtcgaa 1500 

281 ctgctgccgg ttacgccgtt ttaccgcctg tgttgggagt cagggaaggt ctttaattac 1560 

283 gataacgatc aaacccggct cgaagcgcag attcagcagt ttaatccccg cgatgtcgaa 1620 

285 ggttatcgtc agtttctgga ctattcacgc gcggtgttta aagaaggcta tctgaagctc 1680 

287 ggtactgtcc cttttttatc gttcagagac atgcttcgcg ccgcacctca actggcgaaa 1740 

289 ctgcaggcat ggagaagcgt ttacagtaag gttgccagtt acatcgaaga tgaacatctg 1800 

291 cgccaggcgt tttctttcca ctcgctgttg gtgggcggca atcccttcgc cacctcatcc i860 

293 atttatacgt tgatacacgc gctggagcgt gagtggggcg tctggtttcc gcgtggcggc 1920 
295 accggcgcat tagttcaggg gatgataaag ctgtttcagg atctgggtgg cgaagtcgtg _..1980 

297 ttaaacgcca gagtcagcca tatggaaacg acaggaaaca agattgaagc cgtgcattta 2040 

299 gaggacggtc gcaggttcct gacgcaagcc gtcgcgtcaa atgcagatgt ggttcatacc 2100 

301 tatcgcgacc tgttaagcca gcaccctgcc gcggttaagc agtccaacaa actgcagact 2160 



303 aagcgcatga gtaactctct gtttgtgctc tattttggtt tgaatcacca tcatgatcag 

305 ctcgcgcatc acacggtttg tttcggcccg cgttaccgcg agctgattga cgaaattttt 

307 aatcatgatg gcctcgcaga ggacttctca ctttatctgc acgcgccctg tgtcacggat 

309 tcgtcactgg cgcctgaagg ttgcggcagt tactatgtgt tggcgccggt gccgcattta 2400 

311 ggcaccgcga acctcgactg gacggttgag gggccaaaac tacgcgaccg tatttttgcg 2460 



305 ctcgcgcatc acacggtttg tttcggcccg cgttaccgcg agctgattga cgaaattttt 2280 

2400 
2460 

313 taccttgagc agcattacat gcctggctta cggagtcagc tggtcacgca ccggatgttt 2520 



315 acgccgtttg attttcgcga ccagcttaat gcctatcatg gctcagcctt ttctgtggag 2580 

317 cccgttctta cccagagcgc ctggtttcgg ccgcataacc gcgataaaac cattactaat 2640 

319 ctctacctgg tcggcgcagg cacgcatccc ggcgcaggca ttcctggcgt catcggctcg 2700 

321 gcaaaagcga cagcaggttt gatgctggag gatctgattt gaggccatgc aggccgatcc 2760 

323 ccgatcgttc aaacatttgg caataaagtt tcttaagatt gaatcctgtt gccggtcttg 2820 



325 cgatgattat catataattt ctgttgaatt acgttaagca tgtaataatt aacatgtaat 2880 

327 gcatgacgtt atttatgaga tgggttttta tgattagagt cccgcaatta tacatttaat 2940 

329 acgcgataga aaacaaaata tagcgcgcaa actaggataa attatcgcgc gcggtgtcat 3000 

331 ctatgttact agatcgggcc ttaataagct tgttaatcat ggtgtaggca acccaaataa 3060 



333 aacaccaaaa tatgcacaag gcagtttgtt gtattctgta gtacagacaa aactaaaagt 3120 

_ ^ n n***> 3 a t- n^flflflMtfa 3180 

3240 
3300 

JJ> guttLUttow vu^a-wv**.;, " - - ^ . lOCA 

341 cataatgcaa aataagatat catcgatgac atagcaactc atgcatcata tcatgcctct 3 360 



335 aatgaaagaa gatgtggtgt tagaaaagga aacaatatca tgagtaatgt gtgagcatta 
337 tgggaccacg aaataaaaag aacattttga tgagtcgtgt atcctcgatg agcctcaaaa 3240 
339 gttctctcac cccggataag aaacccttaa gcaatgtgca aagtttgcat tctccactga 3300 



343 ctcaacctat tcattcctac tcatctacat aagtatcttc agctaaatgt tagaacataa 3420 

345 acccataagt cacgtttgat gagtattagg cgtgacacat gacaaatcac agactcaagc 3480 

347 aagataaagc aaaatgatgt gtacataaaa ctccagagct atatgtcata ttgcaaaaag 3540 

349 aggagagctt ataagacaag gcatgactca caaaaattca tttgcctttc gtgtcaaaaa 3600 

351 gaggagggct ttacattatc catgtcatat tgcaaaagaa agagagaaag aacaacacaa 3660 

353 tgctgcgtca attatacata tctgtatgtc catcattatt catccacctt tcgtgtacca 3720 

355 cacttcatat atcatgagtc acttcatgtc tggacattaa caaactctat cttaacattt 3780 

357 agatgcaaga gcctttatct cactataaat gcacgatgat ttctcattgt ttctcacaaa 3840 

359 aagcattcag ttcattagtc ctacaacaac gaattcggct tcccgggtac agggtaaatt 3900 

361 tctagttttt ctccttcatt ttcttggtta ggaccctttt ctctttttat ttttttgagc 3960 

363 tttgatcttt ctttaaactg atctattttt taattgattg gttatcgtgt aaatattaca 4020 

365 tagctttaac tgataatctg attactttat ttcgtgtgtc tttgatcatc ttgatagtta 4080 

367 cagaaccgtc gactctagag aagccattta aatcgccgcc accatggcca tcatactcgt 4140 
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369 acgagcagcg tcgccggggc tctccgccgc cgacagcatc agccaccagg 
371 gtgctccacc ctgctcaaga cgaagaggcc ggcggcgcgc cggtggatgc 
373 ccttggcctc cacccgtggg aggctggccg tccctccccc gccgtctact 
375 cgtcaacccg gcgggagagg ccgtcgtctc gtccgagcag aaggtctacg 
377 caagcaggcc gcattgctca aacgccagct gcgcacgccg gtcctcgacg 
379 ggacatggac atgccacgca acgggctcaa ggaagcctac gaccgctgcg 
381 tgaggagtat gccaagacgt tttacctcgg aactatgttg atgacagagg 
383 cgccatatgg gccatctatg tgtggtgtag gaggacagat gagcttgtag 
385 cgccaactac attacaccaa cagctttgga ccggtgggag aagagacttg 
387 cacgggacgt ccttacgaca tgcttgatgc cgctctctct gataccatct 
389 catagacatt cagccattca gggacatgat tgaagggatg aggagtgatc 
3 91 aaggtataac aacttcgacg agctctacat gtactgctac tatgttgctg 
393 gttaatgagc gtaccagtga tgggcatcgc atccgagtct aaagcaacaa 
395 gtacagtgct gccttggctc tcggaattgc gaaccaactc acgaacatac 
397 tggagaggat gctagacgag gaaggatata tttaccacaa gatgagcttg 
399 gctctctgat gaggacatct tcaaaggggt cgtcacgaac cggtggagaa 
401 gaggcagatc aagagggcca ggatgttttt tgaggaggca gagagagggg 
403 ctcacaggct agcagatggc cagtatgggc ttccctgttg ttgtacaggc 
405 tgagatcgaa gccaacgact acaacaactt cacgaagagg gcgtatgttg 
407 gaagttgcta gcacttcctg tggcatatgg aaaatcgcta ctgctcccat 
409 aaatggccag acctagggcc atgcaggccg atccccgatc gttcaaacat 
411 agtttcttaa gattgaatcc tgttgccggt cttgcgatga ttatcatata 
413 aattacgtta agcatgtaat aattaacatg taatgcatga cgttatttat 
415 tttatgatta gagtcccgca attatacatt taatacgcga tagaaaacaa 
417 gcaaactagg ataaattatc gcgcgcggtg tcatctatgt tactagatcg 

420 <210> SEQ ID NO: 3 

421 <211> LENGTH: 5180 

422 <212> TYPE: DNA 

423 <213> ORGANISM: Artificial Sequence 

425 <220> FEATURE: ^^^N ^ <T 

<223> OTHER INFORMATION :/l2 42 2 J i ^T^ V ^^ 
<400> SEQUENCE: 3 
429 gttaatcatg gtgtaggcaa cccaaataaa acacc< 
431 tattctgtag tacagacaaa actaaaagta atgaaaga< 
433 acaatatcat gagtaatgtg tgagcattat gggaccacgj 
435 gagtcgtgta tcctcgatga gcctcaaaag ttctctcac< 
437 caatgtgcaa agtttgcatt ctccactgac ataatgcaa* 
439 tagcaactca tgcatcatat catgcctctc tcaacctattV cattcctact 
441 agtatcttca gctaaatgtt agaacataaa cccataagtc ^acgtttgatg 
443 gtgacacatg acaaatcaca gactcaagca agataaagca 
445 tccagagcta tatgtcatat tgcaaaaaga ggagagctta 
447 aaaaattcat ttgcctttcg tgtcaaaaag aggagggctt Aacattatcc 
449 gcaaaagaaa gagagaaaga acaacacaat gctgcgtcaa utatacatat 
451 atcattattc atccaccttt cgtgtaccac acttcatata Ucatgagtca 
453 ggacattaac aaactctatc ttaacattta gatgcaagag <:ctttatctc 
455 cacgatgatt tctcattgtt tctcacaaaa agcattcagt I cattagtcc 
457 aattcggctt cccaaatcgc cgccaccatg gcttctatga tatcctcttc 
459 acagtcagcc gtgcctctag ggggcaatcc gccgcagtgg cjtccattcgg 
461 tccatgactg gattcccagt gaagaaggtc aacactgaca tjtacttccat 



ggactctcca 
cctgctcgct 
ccagcctcgc 
acgtcgtgct 
ccaggcccca 
gcgagatctg 
agcggcgccg 
atgggccaaa 
aggatctgtt 
caaggttccc 
ttaggaagac 
gaactgtcgg 
ctgaaagcgt 
tccgggatgt 
cacaggcagg 
acttcatgaa 
taactgagct 
agatcctgga 
gtaaagggaa 
gttcattgag 
ttggcaataa 
atttctgttg 
gagatgggtt 
aatatagcgc 



426 
428 



Sttgcacaagg 
atgtggtgtt 
aataaaaaga 
ccggataaga 
ataagatatc 



taatgatgtg 
aagacaagg 




cagtttgttg. 

agaaaaggaa 

acattttgat 

aacccttaag 

atcgatgaca 

catctacata 

agtattaggc 

tacataaaac 

catgactcac 

atgtcatatt 

ctgtatgtcc 

cttcatgtct 

actataaatg 

tacaacaacg 

cgctgtgaca 

cggcctcaaa 

tacaagcaat 



4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5630 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
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